Measurement of CP violation in B s — > J/ip<p decay 
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We briefly discuss measurements of CP violation in B s — > J/ip(j) decay. Both the phenomenology of 
B s mixing and the importance of the measurement to searches for new physics, as well as technical 
details and issues with the analysis are included. While current results are consistent with the 
standard model, even large contributions from new physics cannot be excluded. 
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INTRODUCTION 



6-physics dates back to 1964 when the decay of the 
long lived kaon to two pions, and thus CP violation was 
observed [1]. It did not took very long until a proposal 
for the theoretical explanation of CP violation was made. 
In their famous work, Kobayashi and Maskawa showed 
that with 4 quarks there is no reasonable way to include 
CP violation [2]. Together with this, they also proposed 
several models to explain the observed CP violation in 
the kaon system, amongst which the six quark model 
became favored over time. 

The explanation of CP violation in the six-quark model 
of Kobayashi and Maskawa builds on the idea of quark 
mixing introduced by Cabibbo. The quark mixing intro- 
duces a difference between eigenstates of the strong and 
weak interactions. CP violation requires a complex phase 
in order to provide a difference between a process and its 
charge conjugate. In the four-quark model, the quark 
mixing is described by a 2 x 2 unitarity matrix. With 
only four quarks, states can be always re-phased in order 
to keep the mixing matrix real and thus quark mixing 
cannot accommodate the observed CP violation. With 
the extension to six quarks, the mixing matrix becomes 
a 3 x 3 unitarity matrix, called the Cabibbo-Kobayashi- 
Maskawa matrix, Vckm- In this case there is no possi- 
bility to rotate away all phases and one complex phase 
always remains in the matrix. This complex phase of 
Vckm provides the CP violation in the standard model. 
This idea had two important implications. First, in ad- 
dition to the three quarks known in the early 1970's and 
the predicted charm quark, it postulates the existence of 
two additional quarks, called bottom and top. Second, 
despite the tiny CP violation in the kaon system, the 
proposed mechanism implies large CP violation in the B° 
system. It took almost three decades, but both predic- 
tions were experimentally confirmed, first by the discov- 
ery of the bottom quark in 1977 [3] followed by that of the 
top quark in 1995 [4, 5] and finally by the measurement 
of large CP violation in the B° system in 2001 [6, 7]. The 
observation of large CP violation in B° decays confirmed 
the Kobayashi-Maskawa mechanism as the way to gener- 
ate the CP violation in the standard model, this resulting 



in the 2008 Nobel prize for Kobayashi and Maskawa. 

After confirmation of the standard model, the focus 
shifted to the search for new physics. One of the most 
promising processes is mixing, governed by the CKM 
matrix element Vt s . Indirect information suggests Vt s 
to be almost real, which translates to the fact that the 
CP violation due to the Bg mixing is expected to be 
tiny in the standard model. Amongst possible ways of 
accessing new physics in the B s mixing, the measurement 
of CP violation in B s — > J/ip<f> decay is considered as a 
golden way. In this paper we review the current status 
of the existing measurements in the B s — > J/4>4> decay, as 
well as the issues connected to the measurement itself. 
We will omit here many details in favour of providing a 
comparison of measurements from different experiments 
and discussing features which are not always emphasised. 
Details of separate measurements are available for CDF 
in Refs. [8-10], for D0 in Refs. [11, 12] and for LHCb in 
Ref. [13]. 

This paper is organized as follows: In section II, we 
briefly discuss the phenomenology of B" oscillations. An 
overview of the basic components of the analysis is given 
in section III. In sections IV and V, we discuss in more 
detail, two main components, namely the determination 
of whether B° was produced as B° or B" and the maxi- 
mum likelihood fit. Section VI deals with the statistical 
issues encountered in the measurements. The results are 
given in section VII and the paper concludes with future 
prospects in section VIII. 

Note on particle naming. As this topic requires the 
distinction between the use of both particle and anti- 
particle and the relevant single-flavour cases, we adopt a 
notation in which B s denotes both particle B° and anti- 
particle B°, while the use of B s ° or B° denotes a given 
flavour. 
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II. PHENOMENOLOGY OF THE B° SYSTEM 

The time evolution of the B° system is described by 
the Schrodinger equation 



. d_ ( |B°(t)> \ 

l dt { ib°(*)> ; 



M 



if 



|B°(t)> 
|B?(*)> 



where the 2x2 matrices M and f describe the masses and 
decay rates. Diagonalization leads to eigenstates with 
definite masses and lifetimes 
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P |B S °) 
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with p and q being complex numbers satisfying \p\ 2 + 
\q\ 2 = 1. The states B^) and |B^} have distinct masses 
Mh and Ml and distinct decay widths Th and IY- The 
B° mixing Feynman diagrams responsible for the transi- 
tion of B" to B° and vice versa give rise to the off-diagonal 
elements Mi 2 and Ti2. The B s mixing observables are 
defined to be the mass difference between |B^) and B^) 
states 



AM = M H - M h = 2\M 12 \ 
and the decay width difference 

Ar = r L -rH = 2|r 12 |cos(/> 



(1) 



(2) 



where <f> s — arg(— M12/T12). The standard model pre- 
dictions [14, 15] are AAf = (17.3 ± 2.6) ps" 1 , <j) s = 
(0.22 ±0.06)° and Ar = (0.087 ± 0.021) ps" 1 . Physics 
beyond the standard model can alter the picture by af- 
fecting both M12 and Ti2. In practice most of the 
models of new physics consider only changes in |M 12 | 
and c/) s , and leave |ri 2 | unaffected. The B° mixing fre- 
quency AM is measured most precisely by the CDF ex- 
periment [16] as AM = 17.77 ± 0.10 ± 0.07 ps~\ and 
preliminary with similar precision also by LHCb [17] as 
AM = 17.63 ± 0.11 ± 0.04 ps" 1 . Since AM is pre- 
cisely known and consistent with the standard model ex- 
pectation, new contributions to |Mi 2 | are strongly con- 
strained. At the same time, Ti 2 is dominated by the tree 
level b — > ccs transition and therefore it is non-trivial to 
construct a model which would affect | Ti2 1 significantly 
while at the same time avoiding constraints from existing 
measurements. The phase <f) s was experimentally uncon- 
strained until recently, which made it the prime candi- 
date for searches for new physics. 

The B s mixing phase <j> a can be accessed experimen- 
tally by measurements of the CP asymmetry in flavour- 
specific decays or by CP violation due to an interference 
between decays with and without mixing. The flavour- 
specific CP asymmetry is defined as 



12I 



Mi 



sm(<f) s ). 



(3) 



The challenge of the a f s measurement is in the smallness 
of the effect. If we assume no new physics contribution 
to |Ti2 1 , then the maximum effect would be typically at 
most 5 x 10~ 3 and would not reach values above 10~ 2 
even in the most optimistic scenario. 

In the second type of measurement, one exploits a final 
state which is common to both Bj? and Bj?. In such a case 
an interference between the amplitude for a direct decay 
of Bg to a given final state and the amplitude for B^ 

oscillating to Bj? which then decays to the final state gives 
rise to a time-dependent CP violation. The decay B s — > 
J/i^(f> which we are going to discuss in this review belongs 
to this category. A complication arises from the fact that 
this type of CP violation does not measure directly the <p s 
but rather the relative phase between (f> s and the phase of 
the decay. For the decay B s — > J/i/)<f> the standard model 
expectation [18, 19] for the CP-violating phase is 



f% M = arg 



v C sV; b 



0.02. 



(4) 



As mentioned before, the current focus is on the search 
for new physics. If new physics contributes to the B s mix- 
ing process, it can modify the phase between M12 and Ti 2 



to 



ASM 



H.NP 



where 



H.NP 



is a new physics contri- 



bution. This new phase will then also be reflected in the 
measurement of B s — > J/ip<j> decay where the observable 
phase can be expressed as 



M4> 



-2/?f M + ^ P - 



(5) 



Here we have neglected possible higher order corrections 
to the decay both in the standard model as well as in new 
physics models. With the current generation of measure- 
ments the sensitivity is not sufficient to reach standard 
model values, but it is possible to search for large new 
physics contributions. While precise predictions for new 
physics models are not easily available, large effects are 
possible even after taking into account all existing con- 
straints on possible new physics models. It is also worth 
noting that in many models of new physics, there is a 
correlation between the CP violation in B s mixing and 
other flavour physics observables which can help to dis- 
tinguish different new physics scenarios. For examples of 
a new physics discussion see Refs. [20-27]. 

Finally one remark on the notation which differs be- 
tween experiments. Different symbols are used to denote 
the CP violation in B s — > 3/ipcf) decay and it is not al- 
ways made fully clear how they are defined. On the other 
hand, given the precision of the existing measurements, 
the standard model contributions to the various phases 
can be neglected and we can interpret results as con- 
straints on new physics contributions to the B s mixing 

phase (p s . In the following we will use the symbol (j)^^ to 
denote the phase measured by experiment in B s — > 3/tp(j) 

decay. This relates to S used by CDF as ^> 3 J^^ = —2(3 S 
and is the same as the 4> s used by LHCb. In their latest 
documents, DO uses the symbol </>l^ . 
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III. MEASUREMENT OVERVIEW 

The measurement of CP violation in B s — > J/00 decays 
is based on a time-dependent analysis. As the J/00 final 
state is common to both B" and Bj?, direct decays of the 
B" to J/ip(f> are possible, as well as the decay where the 
B" first oscillates to Bg and then decays. The two paths 
to the final state interfere and give rise to the CP viola- 
tion. The main principle is to extract the time-dependent 
asymmetry 



A(t) = 



N(B° a ,t)-N(B° B ,t) 
N(B°,t)+N(B s ,ty 



(6) 



where N(B° sl t) (N(B°,t)) is the number of B s -> J/V>0 

decays at a given time t where Bg (B°) was originally 
produced. Using this we can identify the main compo- 
nents of the analysis, which are the reconstruction and 
selection of the candidates, the measurement of the decay 
time for each candidate, and finding out whether a given 
candidate was produced as B^ or B^. While the concept 
is rather simple, the fact that both J/0 and are spin-1 
particles means that the final state is a mixture of CP- 
eigenstates. This adds to the complexity of the task as 
one needs to separate also CP-odd and CP-even compo- 
nents through an angular analysis. Therefore in the final 
step, rather than forming an asymmetry, a maximum 
likelihood fit is typically used to extract information on 
the CP violation. The first two steps are rather straight- 
forward and we will discuss them shortly in this section; 
the others are those which require non-trivial work and 
therefore we discuss them in dedicated sections. 

In all three experiments the B s — > J/00 decays are 
reconstructed using the J/0 decays to two oppositely 
charged muons and the decays to oppositely charged 
kaons. The selection uses similar inputs in all three ex- 
periments. Events are typically selected by placing re- 
quirements on the momenta of reconstructed B s candi- 
dates and their daughters, the quality of tracks, and the 
quality of kinematic fits where all four tracks are con- 
strained to originate from a common vertex. LHCb, and 
to some extent also CDF, have particle identification ca- 
pabilities, which are used to separate kaons from more 
abundant pions. While the LHCb and DO experiments 
use sequential rectangular requirements, the CDF selec- 
tion employs a neural network to distinguish signal events 
from a combinatorial background. An important aspect 
of the selection at CDF is that the requirement on the 
resulting neural network output is chosen to minimize 
the expected uncertainties on the measured CP violation. 
This is achieved by performing the analysis on simulated 
experiments, where each simulated experiment uses the 
number of signal and background events corresponding to 
the given requirement on the neural network output. An 
example of the outcome of simulation for the standard 
model CP violation is shown in Fig. 1. The simulations 
which included different size of the CP violation provide 



the same picture. It is interesting to note that the re- 
sulting selection accepts more background that the pro- 
cedure used in early CDF analysis. The selected sample 
comprises of 6504 ± 85 B s — > J/00 signal events at CDF, 
3435 ± 84 signal events at DO and 757 ± 28 B s -> J/0>0 
decays at LHCb. The invariant mass distributions from 
all three experiments are shown in Figs. 1-3. It should 
be noted that LHCb selects only candidates with proper 
decay time larger than 0.3 ps while the Tevatron experi- 
ments accept all events independent of their decay time. 

The decay time for each candidate is extracted from 
the displacement of the B s decay vertex away from the 
primary vertex, which is reconstructed for each event 
separately. The uncertainty on the decay time can be 
estimated separately for each candidate and this can be 
exploited in the analysis. The typical resolution at the 
Tevatron experiments is about 80-100 fs while thanks to 
the higher boost the resolution at LHCb is of the order 
of 50 fs. The factor of 2 in the proper decay time res- 
olution between Tevatron and LHCb gives a significant 
advantage to the LHCb experiment in resolving the fast 
B = oscillations. 



IV. FLAVOUR TAGGING 

As mentioned before, the important part of the anal- 
ysis is to determine whether a given candidate was pro- 
duced as a B° or B". This task is performed by algo- 
rithms called flavour tagging and is one of the most chal- 
lenging parts of the analysis. 

At hadronic machines, most of the b quarks arc 
produced in pairs of opposite flavour (charge). After 
hadronization, in the events interesting for the measure- 
ment, they end up in two independent 6-hadrons. Given 
that many other particles are produced together with the 
fe-hadrons, one can treat the two 6-hadrons in the event as 
independent when considering their time evolution. This 
independence splits possibilities to tag flavour into two 
basic classes. The first one, called same side tagging, ex- 
ploits the fragmentation process by which the B s meson 
is created out of the b quark. The second class, called 
opposite side tagging, uses properties of the decay of the 
6-hadron which contains the other b quark. To fully char- 
acterize the performance of a flavour tagging algorithm, 
two quantities arc needed. The first one is the efficiency, 
e, which gives the fraction of the candidates for which a 
flavour tagging decision is made. The second one, called 
dilution, D, provides the information about the chance 
that the decision made is correct. Formally it is defined 
as 



D = 



N R - N, 



w 



N R + N 



w 



2Pr - 1, 



(7) 



where A R (N w ) is the number of correct (wrong) deci- 
sions and Pr, is the probability of having the correct deci- 
sion. Often the two performance quantities are combined 
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into a single value as eD 2 , which gives the effective statis- 
tics of the tagged sample. If we have N events with given 
flavour tagging performance, they will be equivalent to 
NeD 2 events with perfectly known production flavour. 
Typically the dilution is estimated on a candidate-by- 
candidate basis, and often used as additional informa- 
tion in the fits in order to increase the sensitivity to the 
oscillation behaviour of the B s . 

A. Flavour tagging algorithms 

The physics idea behind the same side tagging is rather 
simple. In order to produce a B[?, one needs to attach a 
strange quark to the bottom anti-quark in the hadroniza- 
tion process. The strange quark normally originates from 



a pair of quark and anti-quark and thus the remain- 
ing strange anti-quark has to end up in another hadron. 
With significant chance this other hadron is a kaon. If 
it is a charged kaon and the experimentalist succeeds 
in picking-up the correct corresponding track, then the 
charge of the track determines also the flavour of the B° 
at the production time. While the general idea is rather 
simple, the details of the hadronization are far from be- 
ing fully understood and therefore it is hard to develop 
such an algorithm. The selection of the tagging track can 
be done based on kinematical and/or particle identifica- 
tion information. From a point of view of kinematics, the 
track which carries the fragmentation partner will typi- 
cally have small transverse momentum relative to the B s 
and a large component of momentum in the direction of 
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the B s . The particle identification is a significant help as 
most of the particles produced in pp or pp collisions are 
pions and therefore a track positively identified as a kaon 
and being close to the B s has a large probability to be 
the right track. 

The opposite side algorithms exploit the decay of the 
second &- hadron in the event. Once its flavour is deter- 
mined, as the bottom quarks are produced in pairs of 
quark and anti-quark, the production flavour of the de- 
tected B s can be taken as opposite. 

The most common way of identifying the flavour of the 
other 6-hadron is to exploit scmilcptonic decays. They 
have a rather large branching fraction and provide a 
unique and clear experimental signature. The charge of 
the lepton is directly correlated to the flavour of the b 
quark in the decaying hadron. All experiments use de- 
cays which contain either an electron or muon. As the 
leptons are dominantly produced by the decays of heavy 
flavour (b and c) quarks, there is a large chance that the 
identified lepton originates in the 6-hadron decay. 

The second piece of information which can be used for 
opposite-side tagging is based on the b —> c — >• s decay 
chain of the b quark with the strange quark ending up 
in kaon. If charged, it contains information about the 
flavour of the decaying 6-hadron. Experimentally one is 
looking for a charged kaon, which does not point to the 
primary vertex. 

The other piece of information to use is more inclusive 
and utilizes a jet charge by CDF experiment and a sec- 
ondary vertex charge by DO and LHCb. The jet charge 
is calculated for each jet as 

Gjet = X^T™NN/X^ W NN, (8) 

where sums run over all tracks associated with the jet 
and qi is the charge of the i-th track, p l T is the transverse 
momentum of the given track, and w l NN is the probability 
that the track originates from the b- hadron decay [28]. 
The calculated jet charge is then used to decide on the 
production flavour of the B s meson. Three classes of jets 
are distinguished at CDF. The first class contains events 
with a secondary vertex within the jet. The second class 
is defined to contain events without a secondary vertex, 
but at least one track with a significant probability to 
come from 6-hadron decay and the third class contains 
all events failing the criteria for the first two classes. The 
reason for the distinction is that different classes have 
different performance and distinguishing them helps to 
optimize the overall performance. The secondary vertex 
charge uses a similar strategy, but instead of using the full 
jet, it exploits tracks assigned to the secondary vertex. 
The charge is calculated as 

Qsv = — , (9j 

where qi is the charge of the track and Wi is the weight. 
There are several options for weights which are in prin- 
ciple equivalent. The DO experiment uses as Wi, the lon- 
gitudinal momentum of the track along the direction of 



total momentum of the tracks assigned to the secondary 
vertex [29]. LHCb on the other hand uses as weight the 
transverse momentum of the track to the power of 0.4 
[30]. 

In each of the three experiments the different algo- 
rithms are executed separately and their outputs are then 
combined together. Typically in the combinations, oppo- 
site side algorithms are combined to a single decision and 
then the same side algorithm, if used, is handled as un- 
corrected to the opposite side decision. For combination 
of opposite side algorithms, CDF uses a neural network 
while DO and LHCb use a likelihood method. 



B. Calibration 

All three experiments make an effort to calibrate the 
mistag probabilities directly on the data. It is useful 
to treat the opposite- and the same-side tagging sepa- 
rately. As the opposite side tagging is independent of 
the reconstructed 6-hadron, it is possible to use the more 
abundant and B° mesons for the calibration. On the 
other hand, same-side tagging depends on the &-meson 
under study and therefore has to be calibrated using the 
B s mesons for the application discussed here. 



1. Opposite side tagging 

The easiest way of calibrating the opposite side tag- 
ging algorithms is to use fully reconstructed T!> ± decays. 
In the context of the measurement of the CP viola- 
tion in B s — > J/ip<j) decays, the most useful decay is 
B ± — > i/ipYJt. The advantage of the B ± is given by 
the fact that its reconstruction efficiency is rather high 
due to the lower number of tracks needed in the recon- 
struction and more importantly that it does not oscillate. 
Therefore the charge obtained from the decay products 
unambiguously identifies the flavour at the production 
time. As the production flavour is known for each candi- 
date, one can easily measure the probability Pr of having 
a correct decision and compare this with the probability 
estimated by the algorithm. The high abundance of the 
reconstructed B^ 1 signal also allows the measurement of 
the flavour tagging asymmetry by splitting the sample 
into B + and B _ . As an example, Fig. 4 shows the in- 
variant mass distribution of reconstructed B^ 1 signal at 
CDF and the dependence of the measured dilution on the 
estimated dilution. In this case, the ideal behaviour is a 
linear dependence with a slope of unity. The slope itself 
is later used in the fit for the CP violation in B s — > J/tpcj) 
to correct the estimated dilutions of the opposite-side 
taggers. 

The second option for the calibration of the opposite- 
side tagging is to use B° decays and measure the oscilla- 
tion pattern. It consists of measurement of the asymmc- 



6 



CDF II Preliminary, L = 5.2 fb 

7 1 4000 - 51 ' 920 +/ " 388 Si 9 nal Events 

w + 

> 

£ 12000- 




J/\|/ K + invariant mass [GeV/c ] 



g 
Q 

T3 

CD 
i 

ZS 

w 
03 
CD 



O 



2 q CDF Run II Preliminary L = 5.2 fb 



: B" events 
1 5^ Slope = 1.12 + 0.10 



1.0 
0.5 
0.0 
-0.5 7 

' 1 -8.o 




0.2 0.4 0.6 0.8 1.0 
OST Predicted Dilution 



FIG. 4. Example of the opposite side flavour tagging calibration using B± J/V'K* decays at CDF. On the left we show the 
J/ipK ± invariant mass distribution showing clear signal. On the right we show the measured dilution as a function of the 
estimated dilution for the B~ only. 



try 



E (*) = 



-^V mix(^) N Linmix(^) 
N mix (t) + iVunmix(t) 



(10) 



where Af m j x (JV unm i x ) is the number of B° candidates with 
the same (opposite) flavour at the production and de- 
cay time. The amplitude of the asymmetry directly en- 
codes the performance of the flavour tagging algorithm. 
While the achievable precision cannot compete with the 
decays, it provides a demonstration that particle os- 
cillations can be resolved and thus builds up the over- 
all confidence in the analysis. While each of the three 
experiments performed the B° mixing measurement at 
some point on the way to current work on CP violation 
in Bg, it is not widely used in the latest results due to its 
statistical limitations. 



2. Same side tagging 

A more difficult part is to calibrate the same side 
flavour tagging as it can be done only by using the B s 

itself. The principle is the same as using B° to calibrate 
the opposite side tagging. Measuring the B s oscillations 
using flavour-specific B s decays, the asymmetry ampli- 
tude provides direct information on the flavour tagging 
power. The difficulty is in the lower B s yield and the 
very fast oscillations, which make it hard to obtain a sig- 
nificant mixing signal. In addition, care has to be taken 
to properly describe the decay time resolution as its mis- 
modelling also affects the mixing asymmetry amplitude. 
On the other hand, one could argue that even if there 
is some decay time resolution mis-modeling, it is likely 



to be the same as in the case of decay B s — >• J/ipcf> and 
therefore does not pose a real issue for the analysis. 

To calibrate the same-side tagging, CDF uses the de- 
cays B s -> and B s ->■ D^^tt+tt^ . The D~ is re- 
constructed in decays to c/>7r - , K*K~ or 7r + 7r~7r - , where 
for the decay B s — > Dj7r ± 7r + 7r _ , only the D~ — > 07r ± 
is used. Altogether about 12900 signal events are re- 
constructed. The same-side flavour tagging algorithm 
estimates for each candidate dilution, which is taken 
into account in the B s oscillation fit. From the mea- 
sured amplitude of the B s mixing, CDF derives a sin- 
gle scaling factor for the estimated dilution of A = 
0.94 ± 0.15(stat) ± 0.13(syst) [31]. In this approach, one 
cannot correct the shape of the dilution distribution, but 
only adjust the overall average scale of it. 

The LHCb experiment plans to use the B s mixing to 
calibrate the same side tagging. The first study using 
data collected in 2010 allowed the measurement of the 
B s mixing frequency using the opposite-side flavour tag- 
ging [32]. While the measurement of the B|? mixing was 
possible, the same side flavour tagging performance is not 
sufficient to obtain a significant signal with the same-side 
flavour tagging only. As a consequence the first measure- 
ment of the CP violation in B s — > J/ip<j> from the LHCb 
experiment does not use the same-side flavour tagging. 

The situation at DO is different as there is no effec- 
tive way to trigger on the hadronic decays. Therefore 
it is impossible to obtain sufficient statistics in the fully 
reconstructed flavour-specific decays. There is the pos- 
sibility of reconstructing a large sample of semileptonic 
decays but due to the missing neutrino, the time reso- 
lution is significantly worse, which makes it difficult to 
obtain a significant B s mixing signal. All this makes cal- 
ibration of the same-side tagging at DO very difficult and 
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the DO experiment did not attempt to perform it up to 
now. It should be noted that while DO used the same- 
side tagging in previous rounds of the analysis, the latest 
analysis does not use same-side flavour tagging. 

C. Performance 

The performance of the flavour tagging at the CDF ex- 
periment is eD 2 « 1.2% for the opposite-side algorithm 
and eD 2 rts 3.2% for the same-side algorithm. As there 
is an overlap between the two taggers, it is not straight- 
forward to combine the two numbers into the single per- 
formance number. The DO experiment achieves for its 
opposite-side flavour tagging eD 2 w 2.5%. Finally the 
opposite-side flavour tagging performance at the LHCb 
experiment is eD 2 « 2.2%. 

It is interesting to compare the CDF and DO perfor- 
mances as the two detectors cover basically the same 
phase space region and work in the same environment. 
The factor two in the performance of the opposite side 
flavour tagging comes almost entirely from the better 
muon system, which has larger coverage at DO compared 
to CDF, but also lower misidentification rate. On the 
other hand, CDF benefits from limited particle identifi- 
cation, which significantly boosts the performance of the 
same-side flavour tagging. The main drawback of the 
same-side flavour tagging is in calibration, which requires 
significant effort for quite limited precision. 

The comparison between the Tevatron and LHCb ex- 
periments is more difficult as they cover different phase 
space regions. Despite that we can still safely say that 
LHCb strongly benefits from the excellent particle identi- 
fication, which boosts their opposite-side flavour tagging 
by identification of kaons coming from the b — > c — > s 
decay chain. Also, thanks to the forward geometry and 
design specific for flavour physics, the identification of 
electrons should be easier compared to the multi-purpose 
detectors at Tevatron. At the same time the same-side 
flavour tagging at LHCb is expected to perform worse 
than at CDF as the track density in the forward region 
is larger and thus it is more difficult to pick up the correct 
track. It should be also noted that in 2010, the LHCb 
experiment was taking data with the number of inter- 
actions per bunch crossing well above the design value, 
which means again a more difficult environment for the 
flavour tagging. 



V. FIT DESCRIPTION 



in detail is the description of the B s decays in the fit. It is 
rather instructive to read through the details in Refs. [33, 
34] . The CDF analysis is based fully on the description in 
Ref. [33]. As we will discuss in the following, the LHCb 
and DO analyses do not implement all the subtleties of 
the decay. 

As we have already discussed, for each candidate which 
the experiments reconstruct, the flavour tagging deter- 
mines whether the candidate was produced as a B|? or 

Bg. In the following, this is encoded in the variable £, 

which takes values 1 for the B°, -1 for the B°, and if 
the flavour tagging is unable to make a decision. The sig- 
nal probability function is given by the weighted average 
of the probability density functions for B° and B° which 
takes the form 

P s (t,p,£\D,a t ) = ±+^P(t,p\a t )e(p) 

+ ^—^-P(t,p\a t )e(p), (11) 

where P and P are the probability density functions for 
B° and Bj?. The quantities t and a t are the decay time 
and its uncertainty for a given candidate, p contains the 
measured decay angles in the transversity basis [35] and 
D is the dilution predicted for the given candidate. Fi- 
nally e(p) parametrizes the angular efficiency. It should 
be noted that each of the experiments has some mini- 
mum requirements on the momentum in order to be able 
to reconstruct a track and this requirement reflects in 
non-uniform angular efficiencies. This minimal require- 
ment is a consequence of the geometry of the detector 
and is in principle very hard to avoid. 

The decay time and angular distribution of the decay 
B s — > J/00 is given in Refs. [35] as 



pcx|A | 2 r + .f 1 (p) + |A||| 2 r + / 2 (p) 

+ \A ± \ 2 T-f 3 (p) + \A ll \\A ± \U + h(p) 

+ |A)P|||cos(<5||)T+/ 5 (a) 

+ \A \\A ± \V+f 6 (p), (12) 



It is time to get to the heart of the analysis, which is 
a maximum likelihood fit. We are going to skip most of 
the details of the background description and refer the 
reader to the original work of the experiments. In short, 
the background is described using a phenomenological 
description derived mostly using data events in the B s 
mass sidebands. The important part we want to discuss 



where |Ao|, \A\\\ and \A±\ are three polarization ampli- 
tudes and the functions fi(p) describe the angular distri- 
butions [35]. The first three terms describe amplitudes- 
squared while the other three terms describe interferences 
between the three amplitudes. The description of B° is 
obtained by substituting U+ by U~ and V+ by V- . The 
functions 7±, U± and V± provide the time dependence 
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and take the form 



e- rt x 



cosh(ALi/2) tcos( 



± rj sin((f)l ) sin( Am s t) 



)sinh(Art/2) 
(13) 



U± = ±e rt x [sin((5j_ — cos(Am s t) 

— cos(<5j_ — cos(0s^^) sin(Am s i) 

T cos((5_l - <5||) sin(^ / ' / ' ) sinh(AR/2)j (,14) 

V± = ±e~ rt x [sin((5 ± )cos(ATO s <) 



— cos(<5j_) cos(0s ) sin(Arn s t) 
T cos((^)sin(0; W )sinh(Art/2) 



(15) 



In this, 5u and <$j_ are strong phases between amplitudes 
and r\ is 1 for B° and —1 for B". 

There are a few interesting points to note which are 
different in this case compared to the analogous analysis 
of the CP violation in B° — > J/ipK®. First if we consider 
the case without flavour tagging, which corresponds to 
the case of £ = for each event, the terms sin(Am s t) 
cancel out, but there are several other terms which are 

sensitive to <pf^"^. This comes from the fact that both J/0 
and are spin 1 particles and thus we deal with a mix- 
ture of CP-even and CP-odd final states, which interferes 
and the non-zero AI\ As a benefit from this complexity, 
the system provides an additional sensitivity to the CP 
violation from the interference terms, which is available 
even without flavour tagging. The second point to note 
is that even if we put the CP violation to zero, equivalent 

to sin(0s ) = 0, the interference between CP-even and 
CP-odd provides sensitivity to the B s mixing frequency 
[33] which can be exploited by experiments like ATLAS 
and CMS to perform a measurement also without access 
to the hadronic B s decays. Finally, in the case of no CP 
violation one is sensitive to the strong phase S± only with 
the flavour tagging, while without the flavour tagging, 8± 
is unaccessible. 

A delicate issue is the question of a possible s-wave con- 
tribution to the reconstructed B s signal. While each ex- 
periment has rather tight selection on the invariant mass 
of the kaon pair around the world average mass, decays 
like nonresonant B s -> J/0K+K - or B s -> J/0f o (98O) 
with f (980) — > K + K~ can contribute as well. Original 
estimates of the branching fraction relative to B s — > J/00 
decay 



P 



fo/<t> 



B(B S J/# (980)) B(f (98Q) 



B(B S — > J/^0) B{4> -> K+K") 



(16) 



yielded values around 0.2 [36]. While this estimate did 
not include the effect of the selection on the kaon pair 
invariant mass, it was argued that neglecting the s-wave 
contribution can bias the result for the CP violation [37]. 
Recent observation of the decay B s — > J/0f o (98O) and 
measurement of Rf /$ to be about 0.25 [38-41] further 



support the necessity to take the s-wave contribution into 
account in some way. With a typical selection on the in- 
variant mass of kaon pairs this would translate to about 
a 1.5% contribution from the B s — > J/0f o (98O) decay in 
the selected sample with rather large uncertainties due to 
badly known f (980) branching fractions. At this stage 
the most complete treatment is done by the CDF experi- 
ment and uses the formalism from Ref. [33] . The analysis 
incorporates an additional amplitude yielding four more 
angular terms, one for the s-wave amplitude squared and 
three for the interference between the original amplitudes 
of the B s — > J/00 decay and the s-wave amplitude. The 
implementation treats the invariant mass of the kaon pair 
as an unobserved variable. While the s-wave complicates 
an already complex analysis further, it might provide an 
additional benefit in helping to resolve ambiguities in the 
value of the CP-violating phase. In contrast to the CDF 
treatment, neither DO nor LHCb implements at this stage 
the s-wave contribution into the fit. As the presence of 
the significant s-wave contribution would introduce some 
asymmetry in the distribution of the kaon angle, exper- 
imentally it is possible to check for its presence. This 
was for the first time seen in the B° — » J/0K* decays at 
the Babar experiment [42]. The DO experiment performs 
such a check by inspecting the forward-backward asym- 
metry in the kaon angle distribution in five different in- 
tervals of the kaon pair invariant mass (see Fig. 5). From 
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FIG. 5. The forward-backward asymmetry of the positive 
kaon angular distribution across the invariant mass of the 
kaon pair. In the absence of the s-wave, one expects a flat 
dependence, while in the presence, the dependence becomes 
non-trivial. 

this they conclude that no significant s-wave is present 
and neglect it in the rest of the analysis. On the other 
hand, LHCb neglects s-wave contribution in the fit, but 
evaluates the systematic uncertainty using information 
on the s-wave derived by the CDF experiment in their 
full fit. Thus while none of the experiments has evidence 
for non-zero s-wave contribution, the CDF and LHCb ex- 
periments include such a possibility to the uncertainties, 
while DO remains more aggressive and does not assign 
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CDF Run II Preliminary L = 5.2 fb" 1 




2Aln(L p ) 

FIG. 6. The map between actual coverage and the likelihood 
ratio for the CDF measurement. The continuous line shows 
ideal Gaussian behaviour, the full histogram the default set 
of generated experiments and the dashed histograms show 16 
alternative sets. 



any uncertainties for the possible s-wave contribution. 



VI. STATISTICAL ISSUES 

While the interference between CP-odd and CP-even 
amplitudes helps in the determination of the CP violat- 
ing phase (j>l^^ , it also introduces some non-trivial sta- 
tistical issues. As is apparent from Sec. V that the prob- 
ability density function contains several periodic func- 
tions, resulting in some symmetries if we consider the 
B s — > J/4>4> decays without the s-wave contribution. Ne- 
glecting the s-wave contribution, the system is invariant 
under simultaneous transformation ^ — > tt — tfil^^ , 
Ar -> -Ar, S ± -> 7T - S±, and 6\\ -> 2ir - 8\\. As a 
consequence there are two equivalent solutions which in 
the case of small statistics are not well separated. This 
fact itself makes minimization of the likelihood a difficult 
task. The issue of symmetries can appear not only for 

$ [ , but also for strong phases and, if their true values 
are close to the symmetry point, their extraction is again 
non-trivial. 

As was discussed before, there is a possibility to ex- 
tract information on the CP violation even without 
flavour tagging. In this case, the situation becomes 
even more difficult. The single symmetry of the flavour 
tagged case turns to two independent symmetries, which 
are -> , 5± S± + tt) and (Ar -> 

— Ar, <f>f^^ — >■ (jyj^"^ — tt). The consequence is the ex- 
istence of four solutions compared to two in the flavour 
tagged case. An additional complication arises from the 



fact that the strong phase 5x appears always in a product 
with sax^a ) . As a result, in case of no CP violation 
there is no sensitivity to <5j_, but if the sensitivity to CP 
violation is small, the fit tends to bias the result as by 
increasing the CP violation, the fit gains 8x as an addi- 
tional parameter available to describe the statistical fluc- 
tuations. Moreover the bias is non- linear and decreases 
with increasing true CP violation. 

It follows from the symmetries that there is a danger 
of non-Gaussian behaviour of the likelihood, which to 
some extent depends on the statistics and the true val- 
ues of the parameters. If the true values are close to the 
symmetry points, more statistics are needed to clearly 
resolve those. Given the importance of the measurement 
for putting bounds on new physics, it is important to 
make sure that any non-Gaussian behaviour is properly 
taken into account. In order to achieve this, the experi- 
ments resort to a frequentist treatment based on the like- 
lihood ratio ordering suggested by Feldman and Cousins 

[43]. To construct confidence level regions in the (j//^^- 
Ar plane, the procedure is to evaluate for each point, 

the ratio of likelihoods between the fit with (f>l and Ar 
fixed to specific values, and the fit where they are allowed 
to float. The likelihood ratio is then compared to the set 
of simulated experiments. For each point a p- value is 
obtained as a fraction of the number of simulated exper- 
iments which have a likelihood ratio larger than the one 
observed in the data. Connecting points with the same 
p- value yields the corresponding confidence-level region. 

The difficulty in the procedure is that there are many 
parameters involved, for which we do not know the true 
values, but only estimates from experiments which could 
be from the analysis itself or some measurement external 
to the analysis. To deal with this, CDF generates one set 
of experiments using the values of all parameters except 
<pg^ and Ar from the global minimum of the likelihood 
together with 16 alternative sets where all parameters 
are chosen randomly from a ±5cr hypercube around the 
global minimum. An example of the map between the 
actual coverage and the likelihood ratio for the CDF ex- 
periment is shown in Fig. 6. While the procedure does 
not guarantee the exact coverage of the derived contours, 
it is assured that there is no undercoverage. The FHCb 
experiment employs a similar procedure, but from the 
available information it is not fully clear to what extent 
they vary the input parameters in the simulated exper- 
iments. The DO experiment decides to take a different 
path and rather than going through a full frequentist 
treatment, they constrain the strong phases to the val- 
ues measured in the B° — > J/-0K* decay. This point is 
rather controversial with theoretical arguments support- 
ing it presented in Ref. [44] , but the argument is generally 
not fully accepted. From the experimental point of view, 
the constraint effectively removes some symmetries and 
better separates two minima. This results in a likelihood 
which is closer to the Gaussian shape, but still needs a 
small adjustment, which is performed in a similar way to 
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CDF and LHCb. 

To finish the discussion on the statistical issues, a note 
on the importance of the flavour tagging and the time 
resolution is in order. As we discussed, there are two 
pieces of information about the CP violation in the anal- 
ysis. The first one is in the interference between CP-even 
and CP-odd amplitudes and this one does not require the 
resolution of the B s oscillations. Therefore the time res- 
olution for this part is not critical. On the other hand, 
the importance is driven by the size of Ar. With larger 
Ar, the importance increases. As a consequence, there 
is some correlation between the uncertainty on (p 3 /^^ and 
the value of Ar extracted by a given experiment. The 
second part of the sensitivity comes from resolving the 
B s oscillations and here the flavour tagging performance 
and the time resolution are crucial. Here LHCb has the 
clear benefit of better time resolution compared to the 
Tevatron experiments. On the other hand, the flavour 
tagging performance is behind the CDF experiment at 
this stage and smears out part of the benefit from their 
better decay time resolution. 



VII. RESULTS 

Typically in each case two different fits are performed. 
One in which no CP violation is assumed and the values 
of physics parameters such as mean lifetime, decay width 
difference, and amplitudes are measured, and the second 
fit in which constraints on CP violation are derived. The 
main result of the analysis is given as bounds in the Ar- 
( j > J/1><t> pi ane _ 

In Fig. 7 we show the confidence level contours in 
the AT-cj) 3 /'^"^ plane from the CDF and DO experiments 
and in Fig. 8, we show the contours derived by the 
LHCb experiment. The reader should be aware that 
CDF uses a different convention than DO and LHCb 
with — 2(3 s = (j) 3 J^^ . As the standard model is a spe- 
cial case, each experiment derives consistency between 
the data and the standard model. The consistency is 
characterized by the p-value, which is 44% at CDF cor- 
responding to about 0.8 standard deviations and LHCb 
finds a p-value of 22% which corresponds to about 1.2 
standard deviations. Those tests provide an answer to 
the question of whether both Ar and <pl ^ are simul- 
taneously consistent with the standard model. DO does 
not evaluate an answer to this question, but from Fig. 7 
we can see that the agreement is equivalent to little more 
than one standard deviation. It is worth noting that 
while CDF and LHCb allow for two solutions and there- 
fore have bounds which are symmetric, DO, by constrain- 
ing the strong phases, allows only for one solution. The 
fact that there are still two solutions in their result is 
an artefact of additional approximate symmetries of the 
problem. Some difference between those approximately 
symmetric solutions is seen, but the statistics are not 
sufficient to decide between them. Before moving on, we 



come back to the point of deriving constraints without 
flavour tagging. In Fig. 9 we show constraints from the 
untagged analysis of the CDF experiments together with 
the result from Fig. 7. As one can see the size of the 
contours is not very different between the two analyses 
and the main help from the flavour tagging is in remov- 
ing two out of the four solutions. The slight shift of the 
two results is due to the difference in the importance of 
each candidate for two analyses. The untagged analysis 
was performed by LHCb but no useful constraint could 
be derived with the current statistics [45]. DO also per- 
formed a fit without flavour tagging and obtained a result 
consistent with the tagged fit. 

What is more interesting for some people is simply 
the value of the CP-violating phase (f> 3 J^^ , rather than 
the allowed region in two-dimensional space. To ob- 
tain this, CDF and LHCb basically repeat the proce- 
dure used for the two-dimensional case where Ar is 
also treated as an unimportant parameter and maxi- 
mize likelihood over it. The procedure yields (f> 3 J^^ e 
[-3.1, -2.16] U [-1.04, -0.04] at 68% confidence level and 

0- s W e [-7r,-1.78]U[-1.36,0.26]U[2.88,7r] at 95% confi- 
dence level at the CDF experiment. At LHCb the allowed 

regions are <p 3 J^" t ' e [—2.7, —0.5] at 68% confidence level 

and t^f^ e [-3.5,0.2] at 95% confidence level. The 
DO experiment does not provide a result in this way, 
but their values can be translated into approximate one- 
dimensional results as the correlation between Ar and 
^j/i/>0 j g reasona biy small, about —18%. The intervals 

they obtain are <f> 3 J^^ G [—1.12, —0.38] at 68% confidence 

level and <^ /W € [-1.65,0.24] U [1.14,2.93] at 95% con- 
fidence level without taking into account any correction 
for non-Gaussian behaviour. Also in the case of one- 
dimensional tests, there is a reasonable agreement be- 
tween the standard model and data. 

As the CDF analysis implements also an s-wave contri- 
bution, it was possible to check also how large an effect 
it introduces. First, in Fig. 10 we show the likelihood 
profile for the amount of s-wave contribution and the in- 
variant mass of the kaon pair, which is not used in the fit. 
From the full angular fit the obtained s-wave fraction is 
consistent with zero. From the likelihood profile one can 
set a Bayesian upper limit of about 7% at 95% credibility 
level for the s-wave fraction within the selected sample. 
One should note that this fraction is selection-dependent. 
As a cross-check one can also check whether the kaon pair 
invariant mass is consistent with the composition found 
in the full angular fit. We show the distribution together 
with the fit where the s-wave fraction is fixed to the best 
value from the angular fit in Fig. 10. As one can see, the 
model fits the data well and therefore adds additional 
confidence into the treatment adapted by the CDF ex- 
periment. Finally also a check for the effect of the s-wave 
can be made. In this CDF compared likelihood contours 
between fits with the s-wave allowed to float, and a fit 
with the s-wave fraction fixed to zero, and there is almost 
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FIG. 8. The bounds in the Ar-f/C plane obtained by 
LHCb experiment. 



CDF Run II Preliminary L = 5.2 fb" 1 




-V.5 -1' ' -0.5 0.5 1 L'5 
p s (rad) 



no visible difference between them. For completeness we 
show results for the lifetime and Ar in Fig. 11. They 
are typically obtained in a fit which assumes no CP vi- 
olation and should therefore be treated in the context of 
the standard model. For results on the amplitudes we 
kindly refer the reader to the original work of the three 
experiments. 

The important question is what the measurements tell 
us about the validity of the standard model and potential 
new physics contributions. In all three experiments, the 
evaluation of the consistency with the standard model 
is available and in all three cases there is no significant 
departure from the standard model. On the other hand, 
solutions obtained in all three experiments go in the same 
direction from the standard model which suggests that 
there might be some effect of new physics. While the 



FIG. 9. The comparison of bounds in Ar-/3 S plane obtained 
by the flavour-tagged (full lines) and the flavour-untagged 
(dashed lines) analysis from the CDF experiment. 



combination would be interesting, unfortunately with the 
information publicly available it is not possible to com- 
bine the results. Moreover with the current precision it is 
not possible to exclude any CP violation provided it lies 
in the half-plane in which all the results are. Therefore 
practically any constraints on new physics models from 
measurements presented here will be rather weak. This 
might not necessarily be the case when combined with 
measurements of other quantities, but such a discussion 
is beyond the scope of this review. 
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FIG. 10. The likelihood profile over the s-wave fraction from the full angular fit at CDF (left). The invariant mass of the kaon 
pair together with the fit from the CDF experiment (right). The fit itself includes the s-wave with fraction fixed from the full 
angular fit. 
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FIG. 11. The comparison of the measured mean lifetimes together with the B° lifetime which is predicted to be the same with 
high accuracy (left) and the decay width difference Ar (right). 



VIII. PROSPECTS 



What to expect in the near future? There are good 
prospects to see new results within a couple of months. 
The Tevatron collider runs well and both CDF and DO 
expect to collect about 10 fb -1 of data by the end of 
September 2011 when the Tevatron will terminate its op- 
eration. While some improvements would still be possi- 
ble, we do not expect a large gain beyond the increased 
statistics. In the meantime, LHC performance is excel- 
lent with LHCb on track to collect about 1 fb -1 of data 
by the end of this year. Given that the first analysis was 
performed on only 37 pb -1 this gives good prospects for 
a large statistical improvement. Moreover as LHCb is an 



experiment which started to take data only last year it 
is reasonable to expect some improvements which could 
help to constrain the CP violation in B s mixing. Finally 
while the ATLAS and CMS experiments did not present 
results in this area, measurement of the CP violation 
in B s — > J/tp'P decay is in their plans with first results 
expected in the near future. To conclude, in the near 
future the precision might be sufficient to see significant 
signal of large CP violation in B s — > i/ip(f) decay or con- 
strain it to values close to the standard model. If it is 
constrained close to the standard model, then question of 
the suppressed standard model contributions will become 
important and could limit capability of bounding new 
physics until progress on the understanding suppressed 
standard model contribution is made [46-48]. 
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